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Information retrieval models: Detecting similar documents using salient terms 
James W. Cooper, Anni R. Coden, Eric W. Brown 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: " gpdfd 80.68 KB) 



We describe a system for rapidly determining document similarity among a set of 
documents obtained from an Information retrieval (IR) system. We obtain a ranked list of 
the most important terms In each document using a rapid phrase recognizer system. We 
store these in a database and compute document similarity using a simple database query. 
If the number of terms found to not be contained In both documents is less than some 
predetermined threshold compared to the total number of terms in the doc ... 



Keywords: databases, document similarity, duplicate documents, shingles, text mining 



Managing electronic interchange of business documents 
Snehamay Banerjee, Ram L Kumar 
July 2002 Communications of the ACM, Volume 45 issue 7 
Publisher: ACM Press 

Full text available: ffi pdfd 12.37 KB) Additional Information: full citation , abstract , references , citings , index 
[g html(30.21 KB) terms 

Establishing a framework for controlled growth of electronic document interchange, based 
on a diverse set of technological, organizational, and interorganizational factors. 

3 Information integration with attribution support for corporate profiles 
^ Thomas Lee, Melanie Chams, Robert Nado, Michael Siegel, Stuart Madnick 
^ November 1999 Proceedings of the eighth international conference on Information 
and Icnowledge management 
Publisher: ACM Press 

Full text available: ^ pdf(845.25 KB) Additional Information: full citation , abstract , references , index terms 

The proliferation of electronically available data within large organizations as well as 
publicly available data (e.g. over the World Wide Web) poses challenges for users who 
wish to efficiently interact with and integrate multiple heterogeneous sources. This paper 
presents CI3, a corporate information integrator, which applies XML as a tool to facilitate 
data mediation and integration amongst heterogeneous sources in the context of financial 
analysts creating corporate ... 
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4 The view from the trenches: issues in the ontology of restricted domains 
David D. McDonald 

October 2001 Proceedings of the international conference on Formal Ontology in 
Information Systems - Volume 2001 

Publisher: ACM Press 



Full text available: ^ pdfn.34MB^ 



Additional Information: full citation , abstract , references , index terms . 
review 



We consider the impact of strict processing limitations on the design of an ontology for 
information extraction from newswire texts. We conclude that requiring online, real-time 
processing leads to a particular set of answers to fundamental issues of relation size and 
the choice of primary categories. We show how to satisfy these requirements by using 
relational categories directly during analysis and by using a reified lattice of their partial 
saturations that is annotated with the ling ... 

Keywords: financial reports, information extraction, ontology design, type-raising 



5 Using leading text for news summaries: evaluation results and implications for \ 
commercial summarization applications 

Mark Wasson 

August 1998 Proceedings of the 17th international conference on Computational 
linguistics - Volume 2 , Proceedings of the 36th annual meeting on 
Association for Computational Linguistics - Volume 2 

Publisher: Association for Computational Linguistics . Association for Computational Linguistics 

pdf(442.07 KB) 

^Additional Information: full citation , abstract, references 
Publislner Site 

Leading text extracts created to support some online Boolean retrieval goals are evaluated 
for their acceptability as news document summaries. Results are presented and discussed 
from the perspective of commercial summarization technology needs. 

6 XSLT powers a new wave of web applications [ 
Cameron Laird 

March 2002 Linux Journal volume 2002 issue 95 
Publisher: Specialized Systems Consultants. Inc. 

Full text available: g| htmin7.61 KB) Additional Information: full citation, abstract , index terms 
Cameron explains the mysteries of XSLT and its multiple uses. 

7 A network-based research support facility and data archive | 
Richard C. Roistacher, Barbara B. Noble 
October 1976 Proceedings of the annual conference 

Publisher: ACM Press 

Full text available- fiQ Ddf(447.76 KB) Additional Information: full citation , abstract, references , citings, index 
terms 

A data archive and research support facility for researchers and planners In criminal justice 
provides services over a nationwide data communication network. The use of a networl< 
allows many services not otherwise possible, such as use of shared archival data files and 
common software. Clients are encouraged to form working groups which share experiences 
and resources. Additional problems encountered in managing a network-based facility are 
discussed. 

The scent of a site: a svstem for analyzing and predicting information scent, usage. | 
and usability of a Web site 
Ed H. Chi, Peter Pirolli, James Pitkow 
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April 2000 Proceedings of the SIGCHI conference on Human factors in computing 

systems 
Publisher: ACM Press 

Full text available: fg|Ddfn.29MB^ Additional Information: full citation , abstract, references , citings, index 

terms 

Designers and researchers of users' interactions with the World Wide Web need tools that 
permit the rapid exploration of hypotheses about complex interactions of user goals, user 
behaviors, and Web site designs. We present an architecture and system for the analysis 
and prediction of user behavior and Web site usability. The system integrates research on 
human information foraging theory, a reference model of information visualization and 
Web data-mining techniques. The system also incorporat ... 

Keywords: World Wide Web, data mining, dome tree, information foraging, information 
scent, information visualization, longest repeated subsequences, usability, usage-based 
layout 



9 Pad: an alternative approach to the computer interface 
Ken Perlin, David Fox 

September 1993 Proceedings of the 20th annual conference on Computer graphics 

and interactive techniques 
Publisher: ACM Press 

Full text available: ^ pdf(234.36 KB) Additional Information: full citation , references , citings , index terms 




10 Risks to the public B 
p. G. Neumann 

October 1990 ACi^ SIGSOFT Software Engineering Notes, Volume 15 issue 5 
Publisher: ACM Press 

Full text available: ^pdf(1.56 MB) Additional Information: full citation, index terms 
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1 Short papers: personal assistants: Context-based similar words detection and its | 
^ application in specialized search engines 
^ Hisham AhMubald, Ping Chen 

January 2005 Proceedings of the 10th international conference on Intelligent user 

interfaces 
Publisher: ACM Press 

Full text available: ^ pdf(67.09 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents a new context-based method for automatic detection and extraction of 
similar and related words from texts. Finding similar words is a very important task for 
many NLP applications including anaphora resolution, document retrieval, text 
segmentation, and text summarization. Here we use word similarity to improve search 
quality for search engines in (general and) specific domains. Our method Is based on rules 
for extracting the words in the neighborhood of a target word, then c ... 



Keywords: information retrieval, word-similarity detection 



2 Information integration with attribution support for corporate profiles 
Thomas Lee, Melanie Chams, Robert Nado, Michael Siegel, Stuart Madnick 
November 1999 Proceedings of the eighth international conference on Information 

and knowledge management 
Publisher: ACM Press 

Full text available: ^ pdf(845.25 KB) Additional Information: full citation , abstract , references , index terms 

The proliferation of electronically available data within large organizations as well as 
publicly available data (e.g. over the World Wide Web) poses challenges for users who 
wish to efficiently interact with and integrate multiple heterogeneous sources. This paper 
presents CIS, a corporate information integrator, which applies XML as a tool to facilitate 
data mediation and integration amongst heterogeneous sources in the context of financial 
analysts creating corporate ... 

Keywords: XML, attribution, data integration, data mediation, metadata 





3 Public access Web information systems: lessons from the Internet EDGAR project 
Ajit Kambil, Mark Ginsburg 

July 1998 Communications of the ACM, Volume 4i issue 7 
Publisher: ACM Press 

Full text available: ^ Pdf(226.49 KB^ Additional Information: full citation , references, citings, index terms 
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^. "Constant, constant, multi-tasking craziness": managing multiple working spheres 




Victor M. Gonzalez, Gloria Mark 

April 2004 Proceedings of the SIGCHI conference on Human factors in computing 



systems 
Publisher: ACM Press 

Full text available: l!gl pdf(503 16KB) Additional Information: full citation , abstract , references, citings, index 
'^^'"^ '' terms 

Most current designs of information technology are based on the notion of supporting 
distinct tasks such as document production, email usage, and voice connmunication. In this 
paper we present empirical results that suggest that people organize their work in terms of 
much larger and thematically connected units of work. We present results of fieldwork 
observation of information workers in three different roles: analysts, software developers, 
and managers. We discovered that all of these types of ... 

Keywords: attention management, empirical study, information overload, interruptions, 
personal information management 



Automatically locating, extracting and analyzing tabular data 
William Kornfeld, John Wattecamps 

August 1998 Proceedings of the 21st annual international ACM SIGIR conference on 
Research and development in information retrieval 

Publisher: ACM Press 

Full text available: ^pdf(297.71 KB) Additional Information: full citation , references , index terms 



r/me/y financial reporting at corporate web sites? 
Michael Ettredge, Vernon J. Richardson, Susan Scholz 
June 2002 Communications of the ACM, Volume 45 issue 6 
Publisher: ACM Press 

Full text available: ^ pdf(129.49 KB) , u * * * * 

iSZ. ./^o .xr^s Additional Information: full citation , abstract , references , index terms 

htmi(23.23 KB) 

Considering the competitive advantage of having current financial information available 
online, it's curious how few institutions actually succeed in this regard. 

Fast supervised dimensionality reduction algorithm with applications to document 

categ orization & retrieval 

George Karypis, Eui-Hong (Sam) Han 

November 2000 Proceedings of the ninth international conference on Information and 

knowledge management 
Publisher: ACM Press 

Full text available: pdf(270.71 KB) Additional Information: full citation , references , citings, index terms 



8 Database mining challenges for digital libraries B 
Robert Grossman 

December 1996 ACM Computing Surveys (CSUR) 
Publisher: ACM Press 

Full text available: g) html(10.82 KB) Additional Information: full citation 



9 The external auditor's review of computer controls 
^ Charles R. Litecky, Larry E. Rittenberg 

May 1981 Communications of the ACM, volume 24 issue 5 
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Publisher: ACM Press 

Full text available: ^pdf(814.82 KB) Additional Information: full citation , abstract , references , index terms 

The Foreign Corrupt Practices Act of 1977, coupled with growing dennands for corporate 
accountability, have forced both auditors and connputer administrators to evaluate 
computer based controls. Computer administrators can benefit from both a knowledge of 
an auditor's approaches to evaluating controls and his/her recommendations for control 
Improvements. Here, a survey of the control evaluation practices and desirable control 
features Identified by computer auditors is presented, along with r ... 

Keywords: EDP audit. Foreign Corrupt Practices Act, computer control, computer 
management 



10 Industry session 1: information retrieval: XML parsing: a threat to database 
^ performance 
^ Matthias Nicola, Jasmi John 

November 2003 Proceedings of the twelfth international conference on Information 
and knowledge management 

Publisher: ACM Press 

Full text available- ISl Ddf(21 0 86 KB) Additional Infomiation: full citation , abstract , references , citings, index 
. i^2H^_a : terms 

XML parsing is generally known to have poor performance characteristics relative to 
transactional database processing. Yet, its potentially fatal impact on overall database 
performance is being underestimated. We report real-word database applications where 
XML parsing performance is a key obstacle to a successful XML deployment. There is a 
considerable share of XML database applications which are prone to fail at an early and 
simple road block: XML parsing. We analyze XML parsing performance an ... 

Keywords: DOM, SAX, XML, database, parser, performance, validation 



A comparison of rankings produced by summarization evaluation measures 
Robert L. Donaway, Kevin W. Drummey, Laura A. Mather 

April 2000 NAACL-ANLP 2000 Workshop on Automatic summarization - Volume 4 
Publisher: Association for Computational Linguistics 

Full text available: ^pdf(1.03 MB) Additional Information: full citation , abstract , references 

Summary evaluation measures produce a ranking of all possible extract summaries of a 
document. Recall-based evaluation measures, which depend on costly human-generated 
ground truth summaries, produce uncorreiated rankings when ground truth Is varied. This 
paper proposes using sentence-rank-based and content-based measures for evaluating 
extract summaries, and compares these with recall-based evaluation measures. Content- 
based measures increase the correlation of rankings induced by synonymous gr ... 



12 Information handling: Mobile phones and paper documents: evaluating a new 
approach for capturing microfinance data in rural India 
Tapan S. Parikh, Paul Javid, Sasikumar K., Kaushik Ghosh, Kentaro Toyama 
April 2006 Proceedings of the SIGCHI conference on Human Factors in computing 

systems CHI '06 
Publisher: ACM Press 

Full text available: ^pdf(781.51 KB) Additional Information: full citation , abstract , references , index terms 

CAM is a user Interface toolkit that allows a camera-equipped mobile phone to interact 
with paper documents. It is designed to automate inefficient, paper-intensive information 
processes in the developing world. In this paper we present a usability evaluation of an 
application built using CAM for collecting data from microfinance groups in rural India. This 
application serves an important and immediate need In the microfinance industry. Our 
quantitative results show that the user interface is ef ... 
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Keywords: ICT, document processing, microfinance, mobile phone, paper user Interface, 
rural development, visual codes 



13 IR-3 (information retrieval): fusion of retrieval systems: A multi-svstem analysis of 
document and term selection for blind feedback 
Thomas R. Lynam, Chris Buckley, Charles L. A. Clarke, Gordon V. Cormack 
November 2004 Proceedings of the thirteenth ACM international conference on 

Information and knowledge management CIKM '04 
Publisher: ACM Press 

Full text available: ^ Ddf(483.55 KB) Additional Information: full citation , abstract , references , index terms 

Experiments were conducted to explore the impact of combining various components of 
eight leading information retrieval systems. Each system demonstrated improved 
effectiveness with the use of <i>blind feedback</i>, in which the results of a preliminary 
retrieval step were used to augment the efficacy of a secondary retrieval step. The hybrid 
combination of primary and secondary retrieval steps from different systems in a number 
of cases yielded better effectiveness than either of the ... 

Keywords: blind feedback, fusion, pseudo-relevance feedback 



Curriculum recomnnendations for graduate professional programs in information 
^ systems 

^ May 1972 Communications of the ACM, Volume 15 issue s 
Publisher: ACM Press 

Full text available: ^ pdf(4.00 MB) Additional Information: full citation , references , citings 



Keywords: education, information analysis, information systems development, 
management information systems, management systems, system design, systems 
analysis 



Automating the Technical Publications Department | 
John D. Browne 

January 1982 Proceedings of the 1st annual international conference on Systems 

documentation 
Publisher: ACM Press 

Full text available: ^ pdf(404.72 KB) Additional Information: full citation , abstract , index terms 

A DAY AT THE OFFICE— 1995 By the time George Denning arrives at his office at Seacom, 
Inc., at 10:30 a.m. this morning, he has already done three hours of work. George, a 
Senior Technical Writer In the Software Publications Departnnent, wanted sonne isolation 
this morning to edit a particularly tricky section in a user guide. His home computer, 
hooked up to Seacom's host computer through the local cable network, allowed him to edit 
the document on-line in the quiet of his den, ... 

The management of end-user computing: status and directions | 
James C. Brancheau, Carol V. Brown 

December 1993 ACM Computing Surveys (CSUR), volume 25 issue 4 
Publisher: ACM Press 

Full text available* 13 odfO 74 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

The development of computing applications by the people who have direct need for them 
in their work has become commonplace. During the 1980s, development of applications by- 
-end users" accelerated and became a key management and research concern. Known as 
"end-user computing/' the phenomena and research associated with this trend cross a 
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variety of disciplines. Tliis article critically surveys the published literature on end-user 
computing (EUC) management according t ... 

Keywords: desktop computing, end-user computing, information center, Information 
technology management, personal computing 



^'^ A web site design nnodel for financial information 
Michael Ettredge, Vernon J. Richardson, Susan Scholz 
November 2001 Communications of the ACM, Volume 44 issue ii 

Publisher: ACM Press 

Full text available: g| odfd 28.70 KB) , ♦ 

^ html(26.09 KB) A^^'*'^"®' Information: full citation , references , citings, index terms 



18 Documentation quality: The documentation of quality engineering: applying use 
cases to drive change in software engineering models 
Ashley Williams 

October 2004 Proceedings of the 22nd annual international conference on Design of 

communication: The engineering of quality documentation 
Publisher: ACM Press 

Full text available: ^ pdf(277.42 KB) Additional Information: full citation , abstract , references , index terms 

This paper exannlnes how documentation Is used to create "quality" engineering processes 
In software development, focusing on recent industry trends of adopting use case driven 
software engineering models, to Investigate a phenomenon that in this paper I call "genre - 
dumping." The paper aims to address questions about how software development methods 
change under a use-case driven model. For example, is it really that easy to adopt the use 
case methodology? The paper draws from a 24-month case ... 

Keywords: iterative methods, requirements engineering, rhetorical genre theory, 
specifications, text analysis, use cases, waterfall methods 




19 Ke y management and key exchange: A temporal key management scheme for 
secure broadcasting of XML documents 
Elisa Bertino, Barbara Carminati, Elena Ferrari 

November 2002 Proceedings of the 9th ACM conference on Computer and 
communications security 

Publisher: ACM Press 

Full text available: ^pdf(242.89 KB) Additional Information: full citation, abstract , references , index terms 

Secure broadcasting of web documents is becoming a crucial need for many web-based 
applications. Under the broadcast document dissemination strategy a web document 
source periodically broad-casts (portions of) its documents to a possibly large community 
of subjects, without the need of explicit subject requests. By secure broadcasting we mean 
that the delivery of information to sub-jects must obey the access control policies of the 
document source. Since different subjects may have the right to ... 

Keywords: XML, secure broadcasting, temporal key management 




20 Electronic signatures: a brief review of the literature 

Aashish Srivastava 

August 2006 Proceedings of the 8th international conference on Electronic 

commerce: The new e-commerce: innovations for conquering current 
barriers, obstacles and limitations to conducting successful business on 
the internet ICEC '06 
Publisher: ACM Press 
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Full text available: ^ pdf(260.60 KB) Additional Information: full citation , abstract , references , index terms 

The advent of the Internet saw technological innovations such as electronic signatures, in 
particular digital signatures, as an electronic equivalent to nnanuscript signatures in the 
online environment. However, the use of this technology Is still insignificant. The ainn of 
this paper is to review the various studies that have explored the technical and legal 
issues associated with electronic signatures and digital signatures with an objective to 
provide insights on their lack of acceptance. 

Keywords: complexity, cost, digital signatures, electronic signatures, lack of acceptance, 
legislation, liability, security 
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1 Proceedings of the SIGNUM conference on the programming environment for 

development of numerical software 
March 1979 ACM SIGNUM Newsletter, Volume 14 issue i 
Publisher: ACM Press 

Full text available: ^pdf(5.02 MB) Additional Information: full citation 



2 Frontmatter (TOC. Letters, Philosophy of computer science. Interviewers needed. 
^ Taking software requirements creation from folklore to analysis, SW components and 
product lines: from business to systems and technology. Software engineering 
survey) 

September 2005 ACM SIGSOFT Software Engineering Notes, Volume 30 issue 5 
Publisher: ACM Press 

Full text available: S pdf(1.98 MB) Additional Information: full citation , index terms 



^ Notable computer networks 

John S. Quarterman, Josiah C. Hoskins 

October 1986 Communications of the ACM, Volume 29 issue lo 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms , review 



Full text available: ' 



Computer networks are becoming more numerous and more diverse. Collectively, they 
constitute a worldwide metanetwork. 



^ Guidelines for selecting a financial modeling language 
Donald A. Heckerman, Ian J. Adams 

January 1978 Proceedings of the 10th conference on Winter simulation - Volume 1 
Publisher: IEEE Press 

Full text available: ^ pdf(440.66 KB) Additional Information: full citation , abstract , index terms 

The purpose of this paper is two-fold. First, to describe some of the different types of 
computer software which can be used to develop financial models, and to Indicate the 
advantages and disadvantages of each type. Second, to suggest a set of guidelines for 
relating the financial modeling needs of the user to the attributes of the numerous 
financial modeling languages which are currently available. The discussion and guidelines 
are based on Ernst & Ernsts experience In car ... 
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5 Information systems outsourcing: a survey and analysis of the literature 
Jens Dibbern, Tim Goles, Rudy Hirschheim, Bandula Jayatilaka 
November 2004 ACM SIGMIS Database, Volume 35 issue 4 

Publisher: ACM Press 

Full text available: ^ pdf(1.51 MB) Additional Information: full citation , abstract , references 

In the last fifteen years, academic researcli on information systems (IS) outsourcing has 
evolved rapidly. Indeed the field of outsourcing research has grown so fast that there has 
been scant opportunity for the research community to take a collective breath, and 
complete a global assessment of research activities to date. This paper seeks to address 
this need by exploring and synthesizing the academic literature on IS outsourcing. It offers 
a roadmap of the IS outsourcing literature, highligh ... 

Keywords: determinants, literature review, outcomes, outsourcing, relationships, 
research approaches, theoretical foundations 




Replication for web hosting systems | 
Swaminathan Sivasubramanian, Michal Szymaniak, Guillaume Pierre, Maarten van Steen 
September 2004 ACM Computing Surveys (CSUR), Volume 36 issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(374.99 KB) Additional Information: full citation , abstract , references , index terms 

Replication is a well-known technique to improve the accessibility of Web sites. It generally 
offers reduced client latencies and increases a site's availability. However, applying 
replication techniques is not trivial, and various Content Delivery Networks (CDNs) have 
been created to facilitate replication for digital content providers. The success of these 
CDNs has triggered further research efforts into developing advanced <i>Web replica 
hosting systems </i>. These are systems that ... 

Keywords: Web replication, content delivery networks 



Innovative Document Systenns: TabulaMagica: an integrated approach to manage 
complex tables 
Horst Siiberhorn 

November 2001 Proceedings of the 2001 ACM Symposium on Document engineering 
Publisher: ACM Press 

Full text available: ^ pdfd 38.97 KB) Additional Information: full citation , abstract , references , index terms 

Tables are a special part of documents and specific means have been developed to 
manage them. Step by step, the underlying models to edit and format tables have been 
improved or supplemented by new ones. These models led to a wide variety of table 
formats and produced "tabular legacies", making it difficult to edit, use, or modify tables in 
varying formats. It is even more time-consuming to convert them for various media or to 
unify or compare tabular information. Our approach to tackle these pr ... 

Keywords: WYSIWYG editor, separation of structure and presentation, table processing, 
tabular legacies 



8 A unified approach to test data analysis 
Michael A. Gianfagna 

June 1978 Proceedings of the 15th conference on Design automation 
Publisher: IEEE Press 

Full text available- IB Ddf(581 18 KB) Additional Information: full citation , abstract , references, citings, index 

'' terms 

To provide cost-effective performance evaluation or engineering feedback from circuit test 
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results often requires that complex analyses be performed on large volumes of non- 
standard data. Using a large scale data management system and a modular design 
philosophy, a system to cope with the above requirements has been developed. TDAS 
(Test Data Analysis System) has provided timely and economic solutions to test data 
analysis problems which might have been intractable by other means. 

9 Real-time shading Q 
^ Marc Olano, Kurt Akeley, John C. Hart, Wolfgang Heidrich, Michael McCool, Jason L Mitchell, 
>^ Randi Rost 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes 
SIGGRAPH '04 

Publisher: ACM Press 

Full text available: ^ pclf(7.39MB) Additional Information: full citation, abstract 

Real-time procedural shading was once seen as a distant dream. When the first version of 
this course was offered four years ago, real-time shading was possible, but only with one- 
of-a-kind hardware or by combining the effects of tens to hundreds of rendering passes. 
Today, almost every new computer comes with graphics hardware capable of interactively 
executing shaders of thousands to tens of thousands of instructions. This course has been 
redesigned to address today's real-time shading capabili ... 

''O Fast detection of communication patterns in distributed executions Q 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 

Studies on Collaborative research 
Publisher: IBM Press 

Full text available: ^ pdf(4.21 MB) Additional Information: full citation , abstract , references, index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-tinne diagrams are often used to obtain a better understanding of the execution, 
of the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not 
provide the user with the desired overview of the application. In our experience, such tools 
display repeated occurrences of non-trivial commun ... 
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Traditional database systems provide a user with the ability to query and manipulate one 
database state, namely the current database state. However, in several emerging 
applications, the ability to analyze "what-if" scenarios in order to reason about the impact 
of an update (before committing that update) is of paramount importance. Example 
applications include hypothetical database access, active database management systems, 
and version management, to name a few. The central th ... 

Keywords: active databases, deltas, execution model for rule application, hypothetical 
access, hypothetical database state 
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The practice of outsourcing among organizations frequently involves external companies or 
consultants introducing texts (or In rhetorical genre theory terms, genres) as means of 
transforming work practices in the company who sought expert help from the outside. 
Such an abrupt-seeming introduction of unfamiliar texts upon workers, either within or 
across organizations, characterizes a practice I call "genre dumping." This practice, 
however, contrasts with the rhetorical genre theory perspective t ... 
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This article describes four case studies in which requirements for new flight software 
subsystems on NASA's Space Shuttle were analyzed using mechanically supported formal 
methods. Three of the studies used standard formal specification and verification 
techniques, and the fourth used state exploration. These applications illustrate two thesis: 
(1) formal methods complement conventional requirements analysis processes effectively 
and (2) formal methods confer benefits even when only selecti ... 
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We describe a system for rapidly determining document similarity among a set of 
documents obtained from an information retrieval (IR) system. We obtain a ranked list of 
the most important terms in each document using a rapid phrase recognizer system. We 
store these in a database and compute document similarity using a simple database 
query. If the number of terms found to not be contained in both documents Is less than 
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In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter devoted 
to a survey of current knowledge representation research. We felt that there were twe 
useful functions such an issue could serve. First, we hoped to elicit a clear picture of how 
people working in this subdiscipline understand knowledge representation research, to 
Illuminate the issues on which current research is focused, and to catalogue what 
approaches and techniques are currently being developed. Secon ... 
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A new system architecture shares certain characteristics with database systems, expert 
systems, functional programming languages, and spreadsheet systems, but is very 
different from any of these. 

Inverted files for text search engines 
Justin Zobel, Alistair Moffat 

July 2006 ACM Computing Surveys (CSUR), Volume 38 issue 2 
Publisher: ACM Press 

Full text available: ^ pdff944.29 KB) Additional Information: full citation , abstract, references , index terms 

The technology underlying text search engines has advanced dramatically In the past 
decade. The development of a family of new index representations has led to a wide range 
of Innovations in index storage, index construction, and query evaluation. While some of 
these developments have been consolidated in textbooks, many specific techniques are 
not widely known or the textbook descriptions are out of date. In this tutorial, we 
introduce the key techniques in the area, describing both a core impi ... 
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